Query-driven Data Integration (Short Paper)
نویسندگان
چکیده
The paper describes an ongoing project that pursues the idea of query-driven data integration. Instead of first creating a common global schema and fetching, transforming, and loading the data to be integrated, we start with the queries. They are taken as a specification of information need and thus as the overall purpose of integration. Two repositories are being developed, one for all information related to the queries and one for potential data sources to which those queries may refer. Queries may have very di↵erent forms, and thus there are many di↵erent ways how they can be used to make the integration e↵ort more e cient.
منابع مشابه
Information Extraction and Integration from Heterogeneous, Distributed, Autonomous Information Sources : A Federated Ontology-Driven Query-Centric Approach
This paper motivates and describes the data integration component of INDUS (Intelligent Data Understanding System) environment for data-driven information extraction and integration from heterogeneous, distributed, autonomous information sources. The design of INDUS is motivated by the requirements of applications such as scientific discovery, in which it is desirable for users to be able to ac...
متن کاملA Query Driven Method of Mapping from Global Ontology to Local Ontology in Ontology-based Data Integration
At present, the mediator/wrapper integration methods are widely used in ontology based data integration because they solve the data update problems of data warehouse method. The key of this method is building of mapping from the global ontology in mediator to the local ontology in wrapper. This article analyzes the general mapping methods and designs a SPARQL query driven Global Local as View (...
متن کاملIntegrating Biological Data and Tools with Bis
|The access and exploitation of integrated data repositories and applications is critical for life science. Biologists design protocols that typically rely on complex query pipelines accessing various biological electronic resources (data sources and tools) to consistute data sets for analysis and mining. Integration platforms are needed to allow biologists to acces, manipulate and analyze elec...
متن کاملQuality-oriented and Metadata-driven Integration in Information Grids
The goal of information grids is to provide a virtually integrated view on information, which is physically stored in many distributed nodes of the grid. A user should be able to query the grid through a uniform query interface using a common data model, without knowing the details of the distribution of the data. Information grids that integrate information from heterogeneous resources have to...
متن کاملUnstructured information integration through data-driven similarity discovery
Information integration from multiple heterogeneous sources is one of the major challenges facing enterprises and service providers today, and one of the important problems in this domain is the integration of structured and unstructured (or text) data. In this paper we describe our work on a data-driven approach to integrating various sources of text data, without relying on the availability o...
متن کامل